Automatic transcription of intonation using an identified prosodic alphabet
نویسنده
چکیده
A solution is proposed for rapidly adapting prosodic models to a new voice or a new application. First, a prosodic alphabet that is supported by linguistic knowledge is identified at the acoustic level. The observation of the realisation of prosodic events on the acoustic corpus allows classes of breaks, F0 shapes and accents to be constructed and automatic transcription rules to be written. Then the transcribed corpus is used in the estimation of the parameters of a prosodic model for French. The good F0 contours and duration generated with the prosodic model verify the agreement of the identified alphabets to describe prosodic phenomena. Finally, the prosodic model is integrated in the CNET standard French Text-to-Speech Synthesis system. The quality of the generated prosody is considered by naïve listeners as equivalent to the handcrafted system. This result verifies the appropriateness of the alphabet as prosodic descriptors.
منابع مشابه
Levels of representation and levels of analysis for the description of intonation systems
It is argued that a satisfactory global theory of intonation will require four levels of analysis : (i) physical (acoustic, physiological) (ii) phonetic (iii) surface phonological and (iv) deep phonological. The theoretical and cognitive status of each level is discussed and specific proposals are made for a model respecting such an overall architecture as well as a condition of interpretabilit...
متن کاملSpeech Analysis for Automatic Evaluation of Shadowing
This paper presents acoustic analysis for the purpose of automatic evaluation of shadowing speech. We use selfchecked scores of understanding, manual prosodic scores, and TOEIC scores as reference scores of learners’ shadowing speech, and compare these scores with automatic scores based on acoustic features that can reflect phoneme intelligibility and prosodic fluency in terms of intonation, an...
متن کاملUnit Selection Speech Synthesis Using Phonetic-Prosodic Description of Speech Databases
This paper describes an approach to speech synthesis based on using speech databases at different stages of TTS process. Speech database units are phones in different segmental and prosodic contexts. Pitch synchronous segmentation and labeling of databases allows storing both segmental and prosodic information. Phonetic-prosodic annotations of speech databases are involved in off-line training ...
متن کاملSLAM: Automatic Stylization and Labelling of Speech Melody
This paper presents SLAM : a simple method for the automatic Stylization and LAbelling of speech Melody. This main contributions over existing methods are : the alphabet of melodic contours is fully data-driven, an explicit time-frequency representation is used to derive complex melodic contours, and melodic contours can be determined over arbitrary prosodic/syntactic units. Additionally, the s...
متن کاملAutomatic recognition of intonation from F0 contours using the rise/fall/connection model
This paper describes an automatic system for labelling intonational tune information based on the Rise/Fall/Connection model of intonation. The system is powerful in that it presupposes no prosodic knowledge of the utterance it is recognizing, and is capable of labelling all the intonational tune eeects of English.
متن کامل